Skip to content

Conversation

@bvaradar
Copy link
Contributor

Issue : Hoodie Utilities bundle does not include (and shade) hive jars unlike hoodie spark bundle. I have made changes to fix that.

I have also removed the profile settings and added doc to show users how to compile against hadoop/hive versions that is used in their deployments. This turns out to be more reliable than making Apache Hive-1.1.1 client work in CDH runtime environment. By removing the profile settings, there is no reference of CDH in Hoodie mvn code.

Testing:
Hoodie supports 3 types of environment (Apache Hive 2.x, Apache Hive 1.x, CDH Hive ). The maven profile settings change affects only the CDH version. I have tested hive sync against CDH environment.

We are also currently testing Deltastreamer against CDH environment to ensure they work end to end.

@bvaradar bvaradar changed the title [WIP] Make HoodieDeltaStreamer hive-sync to CDH Hive Servers Make HoodieDeltaStreamer hive-sync to CDH Hive Servers Feb 4, 2019
@bvaradar bvaradar force-pushed the cdh_deltastreamer_sync branch from 86dfe6f to efedb63 Compare February 5, 2019 19:36
@bvaradar
Copy link
Contributor Author

bvaradar commented Feb 6, 2019

@n3nash: Ready for review.

Copy link
Member

@vinothchandar vinothchandar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

okay makes sense. you just moved all the dependencies bundled from the profiles, directly into the pom..

<artifactId>commons-configuration2</artifactId>
</dependency>

<dependency>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the hive.version we use would be compatible with the spark's bundled hive? even without shading?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Without shading, there is compatibility issue with spark's bundled hive. So, clients need to use hoodie-spark-bundle (instead of hoodie-spark).

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is to even use the DataSource api? Lets chat once f2f

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes. Sounds good.

@bvaradar
Copy link
Contributor Author

@n3nash @vinothchandar : Can one of you merge this if this looks ok.

@n3nash
Copy link
Contributor

n3nash commented Feb 14, 2019

LGTM, I can merge this @vinothchandar

@vinothchandar vinothchandar merged commit a16aa2a into apache:master Feb 16, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants